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Abstract 

The  present  study  shows  by  example  the  potential  amount  of  information  available  in  a  set  of 
observations  of  targets  where  there  are  known  relations  between  these  targets.  Known  relations 
between  objects  significantly  reduces  the  set  of  possible  explanations  behind  a  set  of  observations. 

The  application  here  is  classification  of  military  targets.  Cost  functions  and  interaction  with  deci¬ 
sion  makers  extend  the  feasibility  of  the  present  approach  meaningfully  to  treat  many  observations 
and  possible  targets  behind  these. 

1  Multi-object  state- estimation 

Classification  and  tracking  of  military  targets  are  today  often  made  individually  for  each  object. 
An  approach  also  utilizing  observations  of  other  objects  can  provide  better  results.  Mahler  [1997], 
Kreinovich  [1997]  and  many  others  elaborate,  within  the  framework  of  random  sets,  on  classification 
of  many  objects  based  on  a  variety  of  data  sources.  Theory  of  random  sets  provides  a  direct  approach 
to  treat  a  variety  of  observations  of  many  targets.  This  work  also  takes  a  direct  approach  to  classify 
a  set  of  objects. 

The  focus  of  this  work  is  by  example  quantitatively  to  illustrate  the  potential  amount  of  infor¬ 
mation  available  in  the  whole  set  of  observations  of  actual  targets.  This  approach  utilizes  knowledge 
about  relations  between  possible  targets  and  their  time  varying  parameters  using  computer  based 
tools.  The  present  study  in  general  demonstrates  the  idea  of  searching  through  the  set  of  possible 
explanations  (state  space)  for  a  given  set  of  observations  for  the  purpose  of  object  tracking  and 
classification. 

This  work  consists  of  two  main  parts.  The  first  part  illustrates  the  potential  of  a  full  search 
through  a  small  state  space  (sections  2,  3  and  4).  The  second  part  discusses  search  in  state  spaces 
which  are  too  large  to  search  throughout  (section  5). 

2  Limited  example  of  information  extraction  from  a  set  of 

observations 

Appendix  A  gives  an  example  where  a  restricted  and  known  set  T  =  {a,  6,  c,  d}  of  military  targets 
are  confined  within  an  area.  An  observation  (or  report)  O  =  a  means  that  an  observer  believes  he 
has  observed  a  (instead  of  6,  c  or  d)  at  a  given  location  and  time.  Let  X  denote  the  true  origin  of 
the  observation  O.  If  X  =  a  then  the  observation  O  =  a  is  correct.  Otherwise  it  is  false. 
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Assume,  for  simplicity,  that  each  observation  has  a  given  uncertainty  as  follows:  A  single  ob¬ 
servation  is,  in  our  example,  correct  with  58  percent  probability  and  the  probability  to  give  the 
report  6,  c  or  d  is  each  14  percent  (in  total  100  percent).  So  with  standard  probabilistic  notation, 
P(0  =  a\X  =  a)  =  0.58  and  P(0  =  b\X  =  a)  =  0.14  etc.  We  assume  symmetry  with  respect  to  the 
objects,  so  P(0  =  b\X  =  b)  =  0.58  and  P(0  =  a \X  =  b)  =  0.14  etc.  Equation  1  summarizes  this 
information  where  x,y  E  T  and  1  <  i  <  4. 


P(Oi  =  x\Xt  =  y ) 


0.58  if  x  =  y 
0.14  otherwise 


(1) 


Given  the  observations  Oy.  O2,  O3  and  O4  with  real  origin  Xi,  X2,  X3  and  X4  respectively. 
Assume  that  each  set  of  possible  values  for  Xj  (i.e.  each  state)  has  the  same  initial  probability. 

Assume  we  want  to  estimate/guess  what  is  the  real  origin  X\  behind  the  observation  Oi  =  a. 
The  Bayes  rule  provides  the  probability  that  the  real  target  X.,  is  a: 


P{Xi  =  a\Oi  =  a) 


P{Oi  =  a\Xi  =  a)P{Xl  =  a) 
P{Oi  =  a) 


(2) 


where  P(Oi  =  a)  =  X)sgt  P{0%  =  a|Xj  =  s)P(Xi  =  s)  =  0.58  ■  0.25  +  3  •  0.14  •  0.25  =  0.25.  This  gives 
P(Xj  =  a\Oi  =  a)  =  0.58.  The  probability  for  Xj  =  6  similarly  is: 

P{Oi  =  a) 

which  has  numerical  value  0.14.  Each  estimate  above  only  depends  on  one  single  observation. 

If  the  observations  are  related,  the  observations  O2,  O3  and  O4  contain  information  about  X\. 
I.e.  P{X\\0\  =  a,  02,03,04)  is  in  general  different  from  P(X\\0\  =  a).  We  may,  for  example,  be 
sure  that  all  observations  have  different  objects  as  origin.  This  can  be  the  case  if  the  observations 
for  example  are  simultaneous  or  the  objects  are  geographically  far  from  each  other.  The  real  targets 
(objects)  are  in  this  case  all  different  (provided  reliable  position  estimates  for  the  observations). 
Spatial  location  is  just  one  special  example  of  a  temporal  parameter  for  an  object,  so  this  situation 
is  much  more  general  than  for  simultaneous  observations  of  geographically  distributed  targets  or 
where  the  targets  cannot  move  quickly  enough  to  be  observed  several  times.  It  can  often  be  easy  to 
find  out  that  two  objects,  X\  and  X2,  are  different  even  if  it  is  hard  exactly  to  classify  the  objects. 
For  example,  one  may  easily  see  that  there  are  two  different  people  on  two  different  unclear  photos. 
However,  it  is  in  general  much  harder  to  identify  the  people  more  exactly.  One  may  only  observe 
that  one  person  is  higher  than  the  other  or  have  different  clothes  the  same  afternoon.  One  may  have 
a  cellular  phone  while  the  other  has  not  etc. 

One  may  in  general  assume  a  known  (general)  relation  -<  on  the  set  of  possible  targets  and  that 
this  is  observable.  See  the  illustration  of  Figure  1.  The  actual  relations  may  be  static  (given  for  a 
set  of  objects)  or  it  can  be  situation  dependent.  The  relations  must  in  all  cases,  to  be  utilized,  be 
part  of  (or  given  by)  the  set  of  observations. 

Assume,  for  example,  that  for  any  possible  target  Xj  and  Xj  we  know  if  X%  -<  Xj  is  true  or  not. 
-<  may  simply  mean  ’is  larger /higher /longer  than’,  ’is  different  from’,  ’in  front  of’,  ’earlier  than’, 
’faster  than’  or  it  can  be  based  on  more  complex  geometric  aspect  ratios,  properties  of  radar  return, 
emittance  etc.  Xj  -<  Xj  may  also  mean  for  example  that  ’Xj  wears  glasses  while  Xj  does  not’  from 
which  one  can  derive  that  Xj  /  Xj. 

Assume  one  is  able  to  observe  if  the  relation  Xj  -<  Xj  is  true  or  not  for  any  target  Xj  and  Xj  even 
if  one  do  not  know  the  exact  value  of  Xj  and  Xj.  We  show  below  that  this  can  include  significant 
information  for  classification  of  targets. 
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Figure  1:  Known  relations  between  objects  significantly  restricts  the  number  of  candidate  explana¬ 
tions  (sets  of  classifications)  behind  a  set  of  observations.  Here  each  person  illustrates  an  explanation 
for  a  given  set  of  observations.  If  there  are  no  restrictions,  then  four  possible  objects  and  four  obser¬ 
vations  gives  44  =  256  possible  explanations.  If  any  two  observations  come  from  two  different  targets 
(for  example  spatially  distributed  simultaneous  observations),  then  there  are  only  4!  =  24  possible 
candidate  explanations.  This  gives  a  ratio  of  0.094.  This  ratio  decreases  fast  with  increased  number 
of  possible  targets/observations. 

If  we  assume  all  objects  are  different,  and  denote  this  ’relation’  by  7Z.  we  get  by  simple  numerical 
simulations  (see  appendix  B)  that 

P(A i  =  a\0\  =  a,  O2  =  6,  Os  =  c,  O4  =  d,  1Z)  =  0.80  (4) 

which  is  considerably  different  from  P(X  1  =  a\Oi  =  a)  =  0.58.  If  we  have  ground  truth  data  for  an 

object,  for  example  X2  =  b,  we  get  even  more  certainty: 

P{X  1  =  a\0\  =  a,0-2  =  6, O3  =  c,  O4  =  d,  X 2  =  b,1Z)  =  0.88  (5) 

And  if  we  also  have  the  ground  truth  data  A3  =  cwe  get: 

P(Ai  =  o|Oi  =  a,  02  =  6, 03  =  c,  O4  =  d,  X2  =  6,  A3  =  c,  1Z)  =  0.94  (6) 

If  the  set  of  observations  are  not  consistent,  this  will  increase  the  uncertainty.  Assume  we  know 
that  all  observed  targets  are  different,  but  two  observations  give  the  same  independent  classification 
such  as  for  example  03  =  04  =  c.  We  then  get  that 

P(Ai  =  o|Oi  =  a,  02  =  6,03  =  c,04  =  c,K)  =  0.67  (7) 

The  intuitive  explanation  here  is  that  the  inconsistency  (i.e.  that  O3  =  O4)  tells  that  at  least  one  of 

the  observations  O3  and  O4  is  wrong  and  therefore  a  may  be  hidden  behind  one  of  these  observations. 

If  a  is  given  by  the  observations  to  be  two  places  at  the  same  time,  this  further  increases  the 
confusion: 

P(Ai  =  a|Oi  =  a,  0-2  =  6,  O3  =  c,  O4  =  a,  TV)  =  0.50  (8) 

And  if  a  is  given  to  be  3  places  at  the  same  time,  the  uncertainty  even  increase: 

P(Ai  =  a|Oi  =  a,  O2  =  6,  O3  =  a,  O4  =  a,  71)  =  0.33  (9) 
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If  a  is  initially  classified  to  be  at  4  places  at  the  same  time,  then  P{X\  =  a\0\  =  a,  O2  =  a,  O3  = 
a,  O4  =  a,  77)  =  0.25.  These  estimates  conform  to  common  sense  since  inconsistent  observations  have 
to  include  errors. 

These  estimated  probabilities  can  be  used  in  a  cost  assessment  as  shown  in  appendix  A. 2.  The 
cost  assessment  might  be  of  help  to  a  decision  maker  making  a  decision  of  how  to  react  to  these 
observations. 


3  Generalizations  of  limited  example 

3.1  Increased  and  reduced  number  of  observations 


The  limited  example  of  Section  2  assumes  the  number  of  observations  to  be  the  same  as  the  number 
of  targets.  This  section  shows  a  way  to  generalize  this  restriction  allowing  the  number  of  targets 
to  be  different  from  the  number  of  observations.  The  purpose  of  this  generalization  is  to  show  that 
the  amount  of  available  information  -  the  number  of  observations  -  is  important  with  respect  to 
probability  estimates  like  the  ones  in  equations  4-9. 

Assume  77 1  represents  the  relation  that  the  objects  generating  the  observations  0\y . . . ,  O4  are 
different  (as  in  Section  2),  while  the  objects  behind  0\  and  O5  are  the  same  (i.e.  X\  =  X5).  A  similar 
simple  numerical  simulation  as  in  Section  2,  provides  the  following  estimate: 

P{X  1  =  a\0 1  =  a,  O2  =  6,  O3  =  c,  O4  =  d,  O5  =  a,  77i)  =  0.95  (10) 

This  shows  that  the  additional  observation  O 5,  and  the  known  relationship  that  X\  =  Xr,.  gives  an 
improved  confidence  in  the  classification  of  X\  (cf  Equation  4). 

Reduction  of  the  set  of  observations  also  has  effect  on  the  quality  of  classification  in  the  present 
approach.  Let  1Z-2  denote  that  the  objects  generating  the  observations  Oi,  02,03  are  different.  We 
then  get  that 

P(X  1  =  a|Oi  =  a,  O2  =  b ,  O3  =  c,  IZ2)  =  0.69  (11) 

which  is  a  considerable  reduction  of  confidence  in  the  classification  of  X\  (cf  Equation  4). 

3.2  Classifying  a  set  of  not  precisely  known  targets  within  a  restricted  area 


This  section  illustrates  a  generalization  of  the  classification  problem  of  Section  2  which  assumed  a 
small  number  of  known  targets  within  a  restricted  area.  Assume  now  that  the  actual  four  targets 
are  members  of  a  set  11  =  {04,02,  ■  ■  ■ ,  a  1 1 }  of  11  classifications  and  that  the  quality  of  observations 
are  the  same  as  in  the  above  examples.  Equation  12  defines  (similar  to  Equation  1)  the  quality  of 
the  observations: 


P(Oi  =  x\Xi  =  y) 


0.30  if  x  —  y 
0.07  otherwise 


(12) 


where  both  x  and  y  are  an  element  of  fl  and  1  <  i  <  4.  This  conforms  to  the  quality  of  observations 
assumed  in  Section  2  (Equation  1). 

If  there  are  no  known  relations  between  the  targets,  then  the  probability  of  X\  =  a\  given  the 
observation  0\  =  a\  is  P(X  1  =  ai|Oi  =  ai)  =  0.30  (cf  Equation  2). 

Assume,  as  above,  four  observations  Oi  =  at.  i  =  1,  2,3,4  and  that  all  objects  are  different  (77) . 
We  get  in  this  case  that  (still  via  simple  simulations): 


P(Xi  =  a\\Oi  =  Oj,77)  =  0.32  *  =  1,2,  3, 4 


(13) 


16-4 


RTO-MP-IST-040 


Set  Classification  of  Military  Targets 

where  1Z  as  above  denotes  that  all  objects  X\,. . .  .X4  are  different.  As  opposed  to  in  the  classification 
problem  of  section  2  (see  equation  4),  the  information  (7Z)  that  the  targets  are  different  do  not 
significantly  decrease  the  uncertainty  of  classification  (note  from  above  that  P(X  1  =  U[\Oi  =  a\)  = 
0.30). 

Assume  now,  as  an  illustration,  that  the  relative  sizes  of  the  4  targets  constitutes  a  strong  linear 
relation  -<  on  the  set  0  of  4  targets.  It  might  e.g.  be  known  that  object  1  is  smaller  than  object  2, 
which  is  smaller  than  object  3  etc.  (i.e.  as  if  the  elements  of  If  were  logically  organized  along  a  line). 

This  means,  mathematically,  that  for  any  two  different  objects  X,  Y  in  0,  either  X  Y  or 
Y  X.  It  also  means  that  X  -<Y  and  Y  Z  gives  that  X  -4  Z  (and  it  is  never  true  that  X  -4  X). 

If  we  know,  for  example,  that  X\  is  ’less  than’  another  observed  object  X2  (i.e.  X\  -4  X2),  then  we 
get: 

P(Xi  =oi|Xi  -4X2)  =0.56  (14) 

Similarly  we  have  that 

P(Xi  =oi|Xi  -<X2 -<X3)  =  0.76  (15) 

and  if  all  the  four  observed  objects  are  strongly  linearly  related: 

P(Xi  =ai|Xi  ^X2  ^X3  -ZX4)  =0.90  (16) 


4  Initial  probabilities 

The  above  statistical  approach  assumes  a  homogeneous  distribution  of  the  initial  probabilities  over 
the  set  of  possible  states.  This  is  a  natural  choice  when  there  is  no  information  giving  preference 
to  special  alternatives.  The  use  of  the  relation  1Z  above  can  be  looked  at  as  defining  an  initial 
probability  where  the  states  not  conforming  to  the  relation  has  probability  zero  (and  the  other  states 
each  having  the  same  initial  probability).  A  certain  relation,  in  other  words,  restricts  the  search 
space  or  concentrates  the  probability  to  the  subspace  conforming  to  the  relation.  Other  types  of 
initial  distributions  can  have  similar  effects  on  the  classifications  similar  to  those  above. 

5  Treatment  of  many  objects 

5.1  Restricting  the  search 

The  examples  above  assumes,  for  a  large  number  of  objects,  heavy  computing  and  the  method  is 
therefore  not  directly  practical  for  large  problems.  A  main  challenge  is  therefore  meaningfully  to 
search  through  only  a  smaller  part  of  the  state  space  or  to  search  through  it  in  a  cost  effective 
manner  and  eventually  to  stop  processing  when  sufficient  results  are  obtained.  An  algorithm  can 
use  knowledge  about  physical  restrictions  and  information  from  intelligence  in  order  to  reduce  the 
search  space.  Situation  and  threat  assessments  can  also  provide  information  for  restricting  the  search 
space  for  finding  sufficient  information  for  actual  decisions.  This  approach  for  efficient  search  requires 
flexible  user  interactions. 


5.2  Ordered  and  restricted  search 

Assume  a  set  o  =  {Oi,  02, . . . ,  Om}  of  M  independent  observations  of  singular  members  of  a  large 
set  {ai,a2 . . . ,  a;Y }  of  N  targets  within  a  restricted  area  (say  N  —  20).  Xt  denotes  the  true  value 
behind  the  observation  Oj,  and  an  observer  with  incomplete  information  about  it,  may  treat  Xt  as  a 
random  variable. 
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Assume  that  each  state  x  =  {Xi, . . . ,  Am),  has  an  associated  ” interest  to  know”.  I.e.  there  is  a 
payoff  (or  saved  cost)  associated  with  information  about  the  values  of  A,.  One  may  therefore  want  to 
estimate  the  probability  distribution  for  x  or  at  least  obtain  partly  knowledge  about  its  probability 
distribution. 

An  efficient  search  after  estimates  of  the  probability  of  ’’interesting”  states,  can  utilize  an  initial 
probability  distribution  on  the  state  space.  The  search  can  in  this  way  first  look  through  parts  of  the 
search  space  with  the  highest  expected  payoff.  Let  A  and  B  be  two  different  parts  of  the  search  space. 
Given  that  J(x  6  A)  and  I(x  6  B)  are  the  payoffs  to  know  if  x  6  A  and  x  6  B,  respectively,  and 
that  x  e  A  and  x  6  B  similarly  has  initial  (unconditional)  probabilities  P(x  €  A)  and  P(x  €  B). 
If  a  decision  component  must  make  a  priority  between  calculating  P(x  €  A|o)  or  P(x  6  B|o)  (for 
example  within  a  given  time),  then  estimating  P(x  6  A|o)  would  be  a  natural  choice  if  the  expected 
payoff  (7(A)  =  J(x  6  A)  •  P(x  6  A)  is  significantly  larger  than  (7(B)  =  J(x  €  B)  •  P(x  6  B). 
If  the  payoff  from  estimates  for  x  6  B  is  expected  to  be  much  larger  than  the  cost  of  waiting  for 
the  estimates,  then  a  natural  choice  would  be  to  wait  for  estimates  for  x  €  B.  If  I(x  €  A)  =  0  or 
P(x  6  A)  =  0,  then  the  part  A  may  not  be  interesting  to  search  through. 
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Appendix 
A  Scenario 

A.l  Intelligence  operation  within  a  restricted  area 

The  following  scenario  illustrates  the  use  of  the  present  approach  to  help  establishing  a  situation 
picture.  An  unknown  (potentially  hostile)  group  of  four  boats  (a,  6,  c,  d )  is  observed  entering  a 
Norwegian  fjord.  Intelligence  and  general  situation  assessment  results  in  that  a  team  is  set  up  with 
the  mission  to  monitor  the  boats.  The  monitoring  is  to  be  conducted  in  a  discrete/hidden  manner  to 
prevent  revealing  the  level  of  monitoring,  hence  helicopters,  radars  etc.  cannot  be  used.  Revealing, 
for  an  enemy,  the  level  of  monitoring  and  response  patterns  is  regarded  to  increase  to  vulnerability  in 
the  given  situation  and  it  will  also  increase  the  need  for  costly  change  in  security  related  investments. 
There  are  several  hypotheses  for  why  the  boats  have  entered  the  fjord: 

HI:  They  are  regular  tourist  boats. 

H2:  At  least  one  boat  (boat  a)  is  testing  our  intelligence,  level  of  alert  and  response  patterns. 

H3:  The  boat  a  is  for  sabotaging  a  power  plant  in  the  fjord. 

The  intelligence  team  is  assumed  to  report  if  the  power  plant  is  at  risk  (early  enough  to  protect  this 
possible  target). 
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The  outlet  of  the  fjord  is  well  observed,  meaning  that  it  is  known  how  many  boats  have  entered 
the  fjord  and  that  any  boats  coming  into  the  fjord  will  be  spotted.  The  boats  can  be  distinguished 
from  each  other  via  different  characteristics  (different  sizes,  different  types  of  boats,  etc.).  Further, 
one  can  easily  find  out  if  two  observations  of  boats  are  for  different  boats  or  not.  Each  observation 
is  given  a  confidence  as  in  Equation  1.  The  mission  for  the  team  is  (in  a  hidden  way)  to  monitor  all 
four  boats  as  well  as  possible  while  evaluating  the  risk  for  sabotage. 

A. 2  Cost  Assessment 

The  power  plant  is  assumed  to  have  a  (relative)  value  of  ip0wer  piant  =  100.  The  cost  of  revealing  the 
intelligence/monitoring  level  is  assumed  to  be  less  than  the  limit  5  =  5.  Only  one  of  the  boats  (boat 

а)  is  assumed  to  be  large  enough  to  be  equipped  for  attacking  the  power  plant.  If  the  boat  a  enters 
a  ’’red  zone”  close  to  the  power  plant,  it  is  given  a  probability  pattack  1°  attack  the  power  plant. 
Assume,  according  to  Section  2  above,  the  observations  o  =  {0\  =  a,  O2  =  &,  O3  =  c,  O4  =  d}.  So  if 
a  boat  X\  is  observed  within  the  ’’red  zone”,  and  the  classification  only  includes  use  of  observation 
0\ ,  the  expected  cost  of  the  choice  of  not  taking  action  would  be: 

^no  action  ~  ^attack  '  -^(^l  —  a\0\  =  a)  ■  -Ip0Wer  plant  (1^) 

For  pattack  =  0-10,  P(X  1  =  a|Oi  =  a)  =  0.58,  and  Ip0wer  plant  =  100  this  gives  C  =  5.8  which  is 
close  to  the  cost  of  revealing  the  intelligence.  Hence  the  individual  classification  estimate  P(X  1  = 
a|Oi  =  a)  =  0.58  does  not  provide  a  clear  conclusion  (decision).  The  similar  expected  cost,  taking 
into  account  the  whole  set  of  observations,  would  be 

O'no  action  —  ^attack  ' 

P{Xi  =  o|o,  7V)  ■  -Ip0wer  plant  (1^) 

giving,  according  to  Equation  4,  C  =  8.  If  there  were  ground  truth  data  for  X2  and  A3  (cf  Equation 

б) ,  then  we  would  get  C  =  9.4. 

Assume  (still)  that  boat  a  is  known  to  be  hostile.  If  there  is  no  significant  cost  associated  with 
checking  up  a  regular  tourist  boat,  then  the  expected  cost  of  taking  action  similarly  would  be  less 
than 

^action  =  -5  •  P(X1  =  a\01  =  a)  =  2.9  (19) 

and 

^action  =  5  '  P(X  1  =  o|o,  71)  =  4.0.  (20) 

If  there  were  a  high  cost  associated  with  the  possibility  that  Xi  was  a  regular  tourist  boat,  and 
6,  c  and  d  were  confirmed  to  be  tourist  boats,  then  there  would  be  an  additional  cost  proportional 
to  P(Xi  /  a|Oi  =  a)  =  0.42,  P(Xi  ^  a\o, 7Z)  =  0.20  or  alternatively  P{X\  /  a|o, X-2  =  b,X 3  = 
c,  7Z)  =  0.06.  These  alternative  estimates  for  the  probability  for  Xi  =  a  could  in  this  case  give  large 
differences  in  cost  estimates. 

B  Numerical  simulation  algorithm 

This  section  describes  the  simulations  providing  the  estimates  of  classification  probabilities  used  in 
this  paper.  The  following  points  outline  the  algorithm: 

1.  A  set  of  all  possible  realities  explaining  the  original  observation  vector  (explanations)  is  gener¬ 
ated.  The  size  of  this  set  depends  on  the  size  of  the  original  observation  vector  (the  number 
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ORGANIZATION 


of  observations),  the  number  of  possible  classifications  of  the  objects  generating  the  original 
observation  vector,  and  on  the  possible  restrictions.  E.g.:  with  an  original  observation  vector 
of  5  single  observations,  4  possible  classifications,  and  no  restrictions,  the  set  of  explanations  is 
a  matrix  of  size  54  x  5.  With  an  original  observation  vector  of  size  4,  4  possible  classifications 
and  the  restriction  that  the  4  items  in  the  observation  vector  originate  from  different  objects, 
the  size  of  the  matrix  will  be  4!  x  4. 

2.  For  each  explanation,  1000  observation  vectors  of  the  same  size  as  the  original  observation  vector 
are  generated.  Each  of  these  observation  vectors  are  generated  randomly  based  on  the  sensor 
likelihood,  i.e.:  for  each  entry  in  the  explanation,  a  random  number  is  drawn  and  compared  to 
the  likelihood.  The  outcome  of  this  comparison  decides  what  observation  is  generated  by  this 
particular  explanation. 

3.  The  fraction  of  these  1000  observation  vectors  equal  to  the  original  observation  vector,  repre¬ 
sents  how  well  the  possible  explanation  explains  the  original  observation. 

This  part  of  the  algorithm  is  repeated  N  times,  and  the  mean  fraction  for  each  possible  explanation 
is  recorded  as  the  final  result. 

Consider,  as  an  example,  an  original  observation  vector  oov  =  [  a  b  c  j  ,  where  the  possible 
classifications  are  a,  b  and  c.  With  the  restriction  that  the  3  items  in  oov  originate  from  different 
objects,  the  matrix  of  explanations  will  look  like  this: 

'a  b  c  ' 
a  c  b 
b  a  c 
b  c  a 
cab 
c  b  a 

The  algorithm  then  could  produce  the  following  result: 


possible  explanation 

result 

fraction 

[a  b  c] 

750 

0.75 

[a  c  b] 

130 

0.13 

[b  a  c  ] 

40 

0.04 

[b  c  a] 

20 

0.02 

[cab] 

20 

0.02 

[  c  b  a] 

40 

0.04 

This  result  gives  the  estimate 

P{X1  =  a\Oi  =  a,  02  =  6,  O3  =  c,  W3)  = 

P{X  1  =  a,  X2  =  b,X3  =  c\Oi  =  a, O2  =  b,03  =  c,773)  + 
P{X1  =  a,X2  =  c,X3  =  b\Oi  =  a,  02  =  6, 03  =  c,W3)  = 
0.75  +  0.13  =  0.88 

where  773  represents  the  restriction  that  X\.  X2  and  X3  are  different. 
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Elaboration  of  possible  consistent 
explanations  for  a  set  of  observations  by 
using  available  information. 
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Issues  for  this  talk: 

.Show  examples  of  how  classification  can  be  affected  by 
intelligence  reports  from  a  situation. 

.To  explore  potentials  for  better  formalized  use  of  available 
information  within  classification. 

.Show  generalizations  of  simple  model  study. 

.Illustrate  the  potential  amount  of  information  available  in  a 
set  of  observations  of  targets  where  there  are  known 
relations  between  them. 
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The  set  of  possible  explanations  is  large  - 
hence  a  need  for  reductions 


Known  relations  between 
objects  (and  other  facts)  restrict 
the  number  of  possible 
explanations  for  a  set  of 
observations. 

Relations  between  targets  (also 
uncertain  relations)  can  in 
general  affect  preferences 
between  explanations  (so  a 
relation  may  be  given  with  some 
certainty). 
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Examples  of  relations  between  targets 
(useful  for  classification): 


• — 


Some  targets  are  different  from 
each  other. 

Some  targets  are  larger,  higher, 
longer  than  others. 

Targets  may  have  different 
attributes. 

Some  targets  give  more  radar 
return  than  others. 

Some  targets  are  in  front  of 
others. 
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Example 


.  Four  targets  (boats)  in  a  restricted  area  (Norwegian  fjord). 

.  Four  observations  where  observers  classify  an  object  from 
a  list  of  these  objects. 

.  Likelihood  function:  Given  an  observed  object  ”a”,  then 
there  is  58  percent  probability  it  is  classified  as  ”a”  (and  14 
percent  for  the  other  three). 


16-5 


ill* 


Forsvarets 

forskningsinstitutt 


Example  continued: 

Treatment  of  observations 


.  l/l/e  assume  observations  0  =  {01  -a,  02=b,  03=c,  04-d}  and 
targets  X  =  {XI,  X2,  X3,  X4}  which  we  want  to  classify  (i.e.  we 
want  to  estimate  the  probability  distribution  of  XI,  X2,  X3,  X4 
given  available  information). 

.  Isolated  assessment:  P(X1=a\0=a)  =  0.58  (where  XI  denotes 
the  target  for  observation  01). 

.  Assuming  all  four  targets  different  gives:  P(X1=a\  “ all  targets 
different ’)  =  0.8. 

.  If,  in  addition,  X2-b  is  confirmed,  P(X1-a\  conditions)  =  0.94). 
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Example  2 


.1 1  objects  {a,b . k}. 

.4  observations  {01=a,02=b,03=c,04=d}. 

.Likelihood  function:  P(01=a|X1=a)  =  0.30  and  P(01=a|X1=b)  = 
0.07,  P(02=a|X1=c)  =  0.07  etc. 

.  P(X1  =  a  |  01  =a,02=b,03=c,04=d)  =  0.32. 

.Assuming  the  targets  can  be  linearly  ordered: 

P(X1  =a|X1  <X2<X3<X4J  =  0.90 
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Treatment  of  many  objects 


.  Reduce  search  space. 

.  Ordered  search. 

.  Focus  on  important/critical  information  for  decisions  (for 
example  is  P(a  \  conditions)  >  x  ? 

.  Split  up  the  search  space  (in  independent  parts). 

.  Combine  ’’good  aspects” /  creative  search:  evolution. 

.  Threat  analysis:  Paranoid  search. 


